Physica A: Statistical Mechanics and its Applications
○ Elsevier BV
Preprints posted in the last 30 days, ranked by how well they match Physica A: Statistical Mechanics and its Applications's content profile, based on 10 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.
Pachter, L.
Show abstract
We introduce a spectral existence criterion for the evolution of cooperation in the form of the inequality{lambda} maxb > c, where{lambda} max is the leading eigenvalue of an interaction operator encoding population structure, and b and c represent benefit and cost tradeoffs, respectively. Nowaks five rules for the evolution of cooperation correspond to cases in which the cooperation condition reduces to a scalar assortment coefficient. These results follow from the Price equation, which sheds light on a long-standing debate on the role of inclusive fitness and evolutionary dynamics in explaining the evolution of cooperation.
Sukekawa, T.; Ei, S.-I.
Show abstract
Mass-conserved reaction-diffusion systems are used as mathematical models for various phenomena such as cell polarity. Numerical simulations of this system present transient dynamics in which multiple stripe patterns converge to spatially monotonic patterns. Previous studies indicated that the transient dynamics are driven by a mass conservation law and by variations in the amount of substance contained in each pattern, which we refer to as "pattern flux". However, it is challenging to mathematically investigate these pattern dynamics. In this study, we introduce a reaction-diffusion compartment model to investigate the pattern dynamics in view of the conservation law and the pattern flux. This model is defined on multiple intervals (compartments), and diffusive couplings are imposed on each boundary of the compartments. Corresponding to the transient dynamics in the original system, we consider the dynamics around stripe patterns in the compartment model. We derive ordinary differential equations describing the pattern dynamics of the compartment model and analyze the existence and stability of equilibria for the reduced ODE with respect to the boundary parameters. For a specific parameter setting, we obtained results consistent with previous studies. Moreover, we present that the stripe patterns in the compartment model are potentially stabilized by changing the parameter, which is not observed in the original system. We expect that the methodology developed in this paper is extendable to various directions, such as membrane-induced pattern control.
Kuznetsov, A. V.
Show abstract
Alzheimers disease (AD) is characterized by the accumulation of amyloid-{beta} (A{beta}), yet the specific link between plaque burden and cognitive decline remains a subject of intense investigation. This paper presents a mathematical model that simulates the coupled dynamics of A{beta} monomers, soluble oligomers, and fibrillar species in the brain tissue. By modifying existing moment equations to include a dedicated conservation equation for A{beta} monomers, the model explores how various microscopic processes, such as primary nucleation, surface-catalyzed secondary nucleation, fibril elongation, and fragmentation, contribute to macroscopic disease progression. Central to this study is the concept of "accumulated neurotoxicity" as a surrogate marker of biological age, defined as the time-integrated concentration of soluble A{beta} oligomers. Unlike plaque burden, accumulated neurotoxicity cannot be reversed, and the harm it causes depends critically on the sequence of events that produced it. Numerical results demonstrate that while plaque burden and neurotoxicity both increase over time, their relationship is non-linear and highly sensitive to the efficiency of protein degradation machinery. Specifically, impaired degradation leads to a rapid advancement of biological age relative to calendar age. The model further identifies oligomer dissociation and fibril fragmentation as potential protective mechanisms that can counterintuitively reduce neurotoxic burden by diverting monomers away from the soluble oligomer pool. These findings provide a quantitative framework for understanding why individuals with similar plaque burdens may experience vastly different cognitive outcomes, underscoring the importance of targeting soluble oligomers early in therapeutic interventions.
Ahammed, F.
Show abstract
Fraud in the health landscape is an aggravating issue, with far-reaching consequences burdening the financial stability of the health industry and threatening the quality of medical care. It results from vulnerabilities within the current healthcare framework that are exploited by the fraudsters in their favor. In spite of many developed models that aim to detect fraudulent patterns in insurance claims, the accuracy of such models frequently suffers as a result of the imbalance issue of the Medicare dataset and irrelevant features. This study ventures to improve detection performance and accuracy by employing a deep learning model along with data sampling and feature selection techniques. Comparative analysis among different combinations is conducted to determine their efficacy to enhance the accuracy of the fraud detection model. Hence, the suggested model clearly demonstrates that a combination of myriad data sampling and feature selection techniques is helping to improve accuracy and performance. The accuracy was thus 95.4%, with negligible evidence of overfitting detected using both Chi-square and Synthetic Minority Over-sampling (SMOTE) techniques. Ultimately, the study findings underscore the significance of employing combined techniques instead of using only the baseline deep learning model for better performance in detecting Medicare insurance fraud.
Khan, H.; Garcia-Galindo, P.; Ahnert, S. E.; Dingle, K.
Show abstract
A morphospace is an abstract space of theoretically possible biological traits, shapes, or property values. It is interesting to explore which parts of a morphospace life occupies, as compared to those parts which could be occupied, but are not. Comparing random and natural non-coding (nc) RNA secondary structures is an established approach to studying morphospace occupation for RNA structures. Most earlier studies have focused on the minimum free energy (MFE) structure, while relatively few have looked at the Boltzmann distribution, describing the ensemble of energetically suboptimal RNA folds. These suboptimal structures may have important roles and functions, and hence should be examined carefully. Here we compare random and natural ncRNA in terms of their Boltzmann distributions, finding that natural RNA tend to have very similar profiles to random RNA, with the main difference being that natural RNA are slightly more energetically stable, except for very short sequences (20 to 30 nucleotides) which tend to be slightly less stable. We infer that natural ncRNA occupy similar parts of the morphospace that random RNA do, indicating that the biophysics of the genotype-phenotype map largely determines the ensemble properties of ncRNA.
Hernandez Vargas, E. A.
Show abstract
Evolutionary therapies regulate heterogeneous populations by altering selective pressures through treatment sequences in cancer and infections. This letter develops an invariant-set framework for treatment-induced containment based on positive triangular invariant sets. For periodically switched systems, sufficient conditions are derived for the existence of such invariant regions. Robustness with respect to mutation is established by showing that the invariant simplex persists under small perturbations of the subsystem matrices. In the two-phenotype case, the analysis yields an explicit mutation threshold that separates regimes in which therapy cycling maintains containment from regimes in which mutation can enable evolutionary escape. Simulations illustrate the geometry of the invariant sets and the role of mutation and dwell time in containment robustness.
Vasylenko, L.; Livnat, A.
Show abstract
At the fundamental conceptual level, two alternatives have traditionally been considered for how mutations arise and how evolution happens: 1) random mutation and natural selection, and 2) Lamarckism. Recently, the theory of Interaction-based Evolution (IBE) has been proposed, according to which mutations are neither random nor Lamarckian, but are influenced by information accumulating internally in the genome over generations. Based on the estimation-of-distribution algorithms framework, we present a simulation model that demonstrates nonrandom, non-Lamarckian mutation concretely while capturing indirectly several aspects of IBE: selection, recombination, and nonrandom, non-Lamarckian mutation interact in a complementary fashion; evolution is driven by the interaction of parsimony and fit; and random bits do not directly encode improvement but enable generalization by the manner in which they connect with the rest of the evolutionary process. Connections are drawn to Darwins observations that changed conditions increase the rate of production of heritable variation; to the causes of bell-shaped distributions of traits and how these distributions respond to selection; and to computational learning theory, where analogizing evolution to learning in accord with IBE casts individuals as examples and places the learned hypothesis at the population level. The model highlights the importance of incorporating internal integration of information through heritable change in both evolutionary theory and evolutionary computation.
Frost, H. R.
Show abstract
We describe an approach for analyzing biological networks using rows of the Krylov subspace of the adjacency matrix. Specifically, we explore the scenario where the Krylov subspace matrix is computed via power iteration using a non-random and potentially non-uniform initial vector that captures a specific biological state or perturbation. In this case, the rows the Krylov subspace matrix (i.e., Krylov trajectories) carry important functional information about the network nodes in the biological context represented by the initial vector. We demonstrate the utility of this approach for community detection and perturbation analysis using the C. Elegans neural network.
Yang, F.; Hanks, E. M.; Conway, J. M.; Bjornstad, O. N.; Thanh, N. T. L.; Boni, M. F.; Servadio, J. L.
Show abstract
Infectious disease surveillance systems in tropical countries show that respiratory disease incidence generally manifests as year-round activity with weak fluctuations and irregular seasonality. Previously, using a ten-year time series of influenza-like illness (ILI) collected from outpatient clinics in Ho Chi Minh City (HCMC), Vietnam, we found a combination of nonannual and annual signals driving these dynamics, but with unknown mechanisms. In this study, we use seven stochastic dynamical models incorporating humidity, temperature, and school term to investigate plausible mechanisms behind these annual and nonannual incidence trends. We use iterated filtering to fit the models and evaluate the models by comparing how well they replicate the combination of annual and nonannual signals. We find that a model including specific humidity, temperature, and school term best fits our observed data from HCMC and partially reproduces the irregular seasonality. The estimated effects from specific humidity and temperature on transmission are nonlinearly negative but weak. School dismissal is associated with decreased transmission, but also with low magnitude. Under these weak external drivers, we hypothesize that stochasticity makes a strong sub-annual cycle more likely to be observed in ILI disease dynamics. Our study shows a possible mechanism for respiratory disease dynamics in the tropics. When the external drivers are weak, the seasonality of respiratory disease dynamics is prone to the influence of stochasticity.
Hunter, K. E.; Martin, N. S.
Show abstract
Evolving populations, especially in the strong-selection-weak-mutation limit, can be modelled as adaptive walks on fitness landscapes, moving in fitness-increasing mutational steps until reaching a fitness peak--a local optimum. Simulations of such adaptive walks--on a multi-peaked empirical landscape of the folA gene and on landscapes generated by the Rough Mount Fuji (RMF) model-- have shown that some landscapes are highly navigable, meaning that the highest x% of peaks are reached by >> x% of adaptive walks. This prompts the question of how adaptive walks can be so successful despite the local, myopic rules behind each adaptive step. Here, we investigate this question using simulations and mathematical approximations of random adaptive walks on a simplified RMF landscape. The landscape has a low-to-intermediate fitness region, whose size reconciles a low peak density with a high peak number. Despite the high number of peaks, walkers are likely to exit this region without terminating at a peak because the probability of a peak transition at each step is low and a fitness gradient guides walkers to the high-fitness region in few steps. Thus, three features are sufficient to explain why adaptive walks in the simplified RMF landscape are likely to reach a small fraction of top-ranking peaks: a low-to-intermediate fitness region with a high number of peaks, a low peak-transition probability, and which is crossed in few steps. We find that these three features are also present in the empirical folA landscape, suggesting that similar principles may apply.
CHAKRABORTY, A.; Das, S.; Phyo, M.
Show abstract
Introduction: Understanding the factors influencing perceptions of cancer-related information is crucial for improving public health communication. This study explores the association between perceived difficulty in understanding information related to cancer (Cancer info Hard to Understand) and concerns about the quality of cancer-related information (Concern about Cancer Info Quality) with the extent of difficulty in comprehending medical statistics information (Understanding Medical Statistics). Methods: Data came from the 2022 Health Information National Trends Survey (HINTS). The cross-sectional study included 1972 participants with a response rate of 67.36% for Cancer info Hard to Understand, and 65.31% for Concern about Cancer Info Quality. We investigated the effect of Understanding Medical Statistics on Cancer info Hard to Understand, and Concern about Cancer Info Quality using univariate and multivariable logistic regression models with survey weights. The multivariable logistic regression model was adjusted for age, gender, ethnicity, marital status, education level, employment history, confidence in internet health resources, and social media. The chi-square test was used to measure the association between the predictors and the outcome. Results: Individuals finding medical statistics hard to understand were more likely to be concerned regarding the quality of the cancer-related information (AOR=1.74, 95% CI: [1.20, 2.52]) and also found cancer-related information difficult to comprehend (AOR=1.89, 95% CI: [1.19, 3.00]). Also, the influence of social media on health information seeking was significantly associated with Concern about Cancer Info Quality (AOR=2.24; 95% CI: [1.33, 3.76]), and Cancer info Hard to Understand (AOR=2.84; 95% CI: [1.61, 5.03]). Conclusion: This study highlights the critical role of understanding medical statistics in shaping perceptions of cancer-related information. From an epidemiological perspective, enhancing statistical literacy is essential for making informed health decisions, addressing health disparities, and designing effective, targeted cancer communication strategies.
Kipkoech, G.; Kanda, W.; Irungu, B.; Nyangi, M.; Kimani, C.; Nyangacha, R.; Keter, L.; Atieno, D.; Gathirwa, J.; Kigondu, E.; Murungi, E.
Show abstract
Malaria is one of the deadliest diseases in sub-Saharan Africa and Southeast Asia. The majority of the fatalities occur mostly in children under 5 years and pregnant women and this is due to infection by Plasmodium spp, of which Plasmodium falciparum is the most virulent and is responsible for most of the morbidity and mortality. Despite various public health interventions such as use of insecticide-treated bed nets, spraying of homes with insecticides and use of WHO recommended artemisinin-based combination therapies (ACT), malaria prevention still faces major setback due to drug and insecticide resistance by P. falciparum and mosquitoes respectively. The study uses molecular docking and immunoinformatics to screen various Plasmodium spp antigens and evaluate their antigenicity and suitability as vaccine candidates. The P. falciparum antigens and T-cell receptor (TCR) structures were obtained from Protein Data Bank (PDB) based on a range of factors related to their role in the lifecycle of the parasite and their status as vaccine targets. Protein structures not available in the PDB were predicted using AlphaFold. The 3D structures of selected P. falciparum antigens and TCR structures were downloaded in PDB format then all water molecules, Hetatm, and bound ligands were deleted from the protein structures using BIOVIA Discovery Studio Visualizer. Subsequently, molecular docking was done using ClusPro v2.0 server and docked complexes were compared. The findings of this study gave valuable insights into the interaction of human immune response with P. falciparum antigens. The best three ranked antigen complexes are PfCyRPA, PfMSP10 and PfCSP and this confirm their use as potential candidates for vaccine development. This study highlights the usefulness of computational docking in identifying P. falciparum antigens of excellent immunogenic potential as vaccine candidates.
Peimankar, A.; Hossein Motlagh, N.; K. Khare, S.; Spicher, N.; Dominguez, H.; Abolghasemi, V.; Fujiwara, K.; Teichmann, D.; Rahmani, R.; Puthusserypady, S.
Show abstract
Background: Atrial fibrillation (AFib) is the most common sustained arrhythmia in the world, imposing a heavy clinical and economic burden on global healthcare systems. Early detection of AFib can reduce mortality and morbidity, while helping to alleviate the growing economic burden of cardiovascular diseases. With the increasing availability of digital health technologies, computational solutions have great potential to support the timely diagnosis of cardiac abnormalities. Objectives: With the increasing availability of electrocardiogram (ECG) data from clinical and wearable devices, manual interpretation has become impractical due to its time-consuming and subjective nature. Existing automated approaches often rely on single classifiers or fixed ensembles that primarily optimize predictive accuracy while neglecting model diversity, which leads to limited robustness and generalization across heterogeneous datasets. Therefore, this study aims to develop a robust and diversity-aware framework for automatic AFib detection that simultaneously improves classification performance and model generalizability. To this end, we propose MOE-ECG, a multi-objective ensemble selection and fusion framework that explicitly optimizes both predictive performance and inter-model diversity for reliable AFib detection from ECG recordings. Methods: The proposed multi-objective ensemble (MOE) framework uses ensemble selection as a bi-objective optimization problem and employs multi-objective particle swarm optimization to identify complementary classifiers from a heterogeneous model pool. Unlike conventional ensembles, it explicitly optimizes both predictive performance and diversity and integrates Dempster-Shafer theory for uncertainty-aware decision fusion. After filtering the ECG signals to remove baseline wander and noise, they were segmented into windows of 20, 60, and 120 heartbeats with 50% overlap. The proposed approach was evaluated over five independent runs to assess its stability and generalization. Fifteen statistical and nonlinear features were obtained from the RR-intervals of the pre-processed ECG signals, of which eight features were selected with correlation analysis to capture subtle information from the ECG data. We trained and evaluated the performance of the proposed model in three open source databases, namely, the MIT-BIH Atrial Fibrillation Database, Saitama Heart Database Atrial Fibrillation, and Long-Term AF Database. Results: The proposed approach achieved the best overall performance on 60-beat segments, with an average accuracy of 89.85%, precision of 91.14%, recall of 94.19%, an F1-score of 92.64%, and area under the curve (AUC) of around 0.95. Statistical analysis using Holm-adjusted Wilcoxon tests confirmed significant improvements (p<0.05) compared to both the best individual classifier and the unoptimized average ensemble of all classifiers. These findings show that the proposed selection and evaluation methodology, rather than group aggregation alone, is the key driver of performance improvements. Conclusion: The results obtained demonstrate that the MOE-ECG model offers a robust, accurate, and reliable solution for the detection of AFib from short ECG segments. The empirical findings, in general, confirm that multi-objective ensemble fusion enhances diagnostic performance and offers robust predictions that will open up possibilities for real-time AFib detection in clinical and tele-health settings.
Chattaraj, A.; Kanovich, D. S.; Ranganathan, S.; Shakhnovich, E. I.
Show abstract
Phase separated condensates are recognized as a ubiquitous mechanism of spatial organization in cell biology. Biophysical modeling of condensates provides critical insights into the dynamics and functions of these subcellular structures that are difficult to extract via experiments. Here we present an efficient computational pipeline, CASPULE (Condensate Analysis of Sticker Spacer Polymers Using the LAMMPS Engine), to simulate and analyze the biological condensates made of sticker-spacer polymers. CASPULE implements a unique force field that combines traditional Langevin dynamics with a "detailed balance proof" protocol for single-valent bond formation between stickers. This framework allows us to study the non-trivial biophysics that emerge out of the single-valent sticker interactions coupled with the effect of separation in energetic contribution by stickers and spacers. We provide detailed documentation on how to setup the simulation environment, perform simulations and analyze the results. Through case studies, we highlight the utility and efficacy of our pipeline. Importantly, we provide statistical parameters to characterize the cluster size distribution often observed in biological systems. We envision this tool to be broadly useful in decoding the interplay of kinetics and thermodynamics underlying the formation and function of biological condensates.
Li, L.; Pohl, L.; Hutloff, A.; Niethammer, B.; Thurley, K.
Show abstract
Cytokine-mediated communication is a central mechanism by which immune cells coordinate activation, differentiation and proliferation. While mechanistic reaction-diffusion models provide detailed descriptions of cytokine secretion and uptake at the cellular scale, their computational cost limits their applicability to large and densely packed cell populations. Previously employed approximations of cytokine diffusion fields rely on assumptions that neglect the influence of cellular geometry and volume exclusion. In this work, we study a macroscopic description of cytokine diffusion and reaction dynamics based on homogenization techniques, rigorously linking microscopic reaction-diffusion formulations to effective continuum models. The resulting homogenized equations replace discrete responder cells with a continuous density, while retaining essential features of cellular uptake and excluded-volume effects. Further, we show that in regimes with approximate radial symmetry, classical Yukawa-type solutions emerge as limiting cases of the homogenized model, provided appropriate correction factors are included. Overall, our approach allows efficient multiscale modeling of cytokine signaling in complex immune-cell environments.
Dvoriashyna, M.; Zwanenburg, J. J. M.; Goriely, A.
Show abstract
Cerebrospinal fluid (CSF) is a Newtonian fluid that bathes the brain and spinal cord and oscillates in response to the physiological periodic changes in brain volume, of which the cardiac cycle is a major driver. Understanding this motion is essential for clarifying its contribution to solute transport, waste clearance, and drug delivery. In this work, we study oscillatory and steady streaming flow in the cranial subarachnoid space using a lubrication-based theoretical framework. The model represents the cranial CSF compartment as a thin fluid layer bounded internally by the brain surface and externally by the dura, driven by time-dependent brain surface displacements. We first derive simplified governing equations for flow over an arbitrary smooth sphere-like brain surface and obtain analytical solutions for an idealised spherical geometry with uniform displacements. We then incorporate realistic displacement fields reconstructed from MRI measurements in healthy subjects and solve the reduced equations numerically. The results show that oscillatory forcing produces a steady streaming component that may enhance solute transport compared with diffusion alone. This work provides a mechanistic description of the flow generated by physiological brain motion and highlights the potential presence of steady streaming in cranial subarachnoid fluid dynamics.
Garg, A.; Gielnik, M. B.; Kjaergaard, M.
Show abstract
Proteins with intrinsically disordered regions (IDRs) migrate at a higher apparent molecular weight in sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) complicating their analysis and identification. Here, we investigate the sequence determinants of the hypomobility of IDRs using a series of synthetic low complexity domains. We find that negative charge increases the apparent molecular weight, but neutral polar tracts also have abnormally slow migration. Positive charge and hydrophobic residues decrease the apparent molecular weight, although lysine residues show a biphasic effect with decreased migration at high fractional contents. Combinations of residues show that different sequence contributions to the apparent molecular weight are not additive. The results can be rationalized by the protein-decorated micelle model by considering both SDS binding and the compaction of protein SDS-complexes.
Brulhart, D.; Magini, G.; Schafer, A.; Schwab, S.; Held, U.
Show abstract
Objectives: Clinical prediction models estimate the risk of a future outcome in patients. Such models are often externally validated using independent datasets; however, even when a model has been rigorously validated in a new setting and patient population, its performance across other clinical settings remains unclear. Therefore, we systematically evaluated model performance and clinical utility across diverse patient populations to quantify the limits of transportability. Methods: Using liver transplantation as an example, we used the UK donation-after-circulatory-death (DCD) risk score and descriptive statistics from Swiss DCD liver transplant populations to simulate realistic target populations with varying donor and recipient characteristics. The risk score's ability to predict one-year graft failure was evaluated using calibration intercept, calibration slope, area under the receiver operating characteristic (ROC) curve, and net benefit. Results: The UK DCD Risk Score's performance depended heavily on the simulated population characteristics. While the score performed adequately in settings similar to those where it was derived, it was not satisfactory in others. Discussion: The study showed, using a risk score in liver transplantation as an example, that the application of a prediction model can be limited in certain external populations when they differ, and that its transportability in new settings is not guaranteed. Conclusion: This study highlights the importance of external validation of clinical prediction models to determine transportability to various target populations. Their application requires careful consideration and potential model re-estimation.
Grigas, A. T.; Sumner, J.; O'Hern, C. S.
Show abstract
Protein structure is controlled by a high-dimensional energy landscape, which is a function of all of the atomic coordinates of the protein. Can this landscape be accurately described by a low-dimensional representation? We find that residue core identity, a binary N-dimensional encoding indicating whether each of the N amino acids in a protein is buried in the core or not, can predict the proteins backbone conformation more efficiently than all other representations that we tested. Core identity is 4 times more efficient than previous estimates of the bits per residue needed to encode a proteins native fold, 2 times more efficient than the C contact map, and 1.5 times more efficient than the machine-learned embeddings from FoldSeeks 3Di. Even when the folded structure is unavailable, predicting each residues burial from sequence yields a more accurate estimate of fold quality than predicting pairwise contacts from the same sequence information. Thus, this work emphasizes that the problem of determining a proteins native fold can be re-framed as predicting each residues core identity.
Teshirogi, Y.; Terada, T.
Show abstract
Molecular dynamics (MD) simulations are a powerful tool for investigating biomolecular dynamics underlying biological functions. However, the accessible spatiotemporal scales of conventional all-atom simulations remain limited by high computational costs. Coarse-graining reduces these costs by decreasing the number of interaction sites and enabling longer timesteps. In extreme cases, proteins are represented as single spherical particles; while such approximations facilitate cellular-scale simulations, they often sacrifice essential structural information, such as molecular shape and interaction anisotropy. Here, we present CGRig, a rigid-body protein model with residue-level interaction sites designed for long-time, large-scale simulations. In CGRig, each protein is treated as a single rigid-body embedding residue-level interaction sites. Its translational and rotational motions are described by the overdamped Langevin equation incorporating a shape-dependent friction matrix. Intermolecular interactions are calculated using G[o]-like native contact potentials, Debye-Huckel electrostatics, and volume exclusion. We validated that CGRig accurately reproduces the translational and rotational diffusion coefficients expected from the friction matrix for an isolated protein. For dimeric systems, the model successfully maintained native complex structures. Furthermore, two initially separated proteins converged into the correct complex with an association rate consistent with all-atom simulations. Notably, CGRig achieved a simulation performance exceeding 17 s/day for a 1,024-molecule system. These results demonstrate that CGRig provides an efficient framework for simulating protein assembly while retaining residue-level interaction specificity, making it a valuable tool for investigating large-scale biomolecular self-assembly.